104 research outputs found
Detecting highly overlapping community structure by greedy clique expansion
In complex networks it is common for each node to belong to several
communities, implying a highly overlapping community structure. Recent advances
in benchmarking indicate that existing community assignment algorithms that are
capable of detecting overlapping communities perform well only when the extent
of community overlap is kept to modest levels. To overcome this limitation, we
introduce a new community assignment algorithm called Greedy Clique Expansion
(GCE). The algorithm identifies distinct cliques as seeds and expands these
seeds by greedily optimizing a local fitness function. We perform extensive
benchmarks on synthetic data to demonstrate that GCE's good performance is
robust across diverse graph topologies. Significantly, GCE is the only
algorithm to perform well on these synthetic graphs, in which every node
belongs to multiple communities. Furthermore, when put to the task of
identifying functional modules in protein interaction data, and college dorm
assignments in Facebook friendship data, we find that GCE performs
competitively.Comment: 10 pages, 7 Figures. Implementation source and binaries available at
http://sites.google.com/site/greedycliqueexpansion
Seeding for pervasively overlapping communities
In some social and biological networks, the majority of nodes belong to
multiple communities. It has recently been shown that a number of the
algorithms that are designed to detect overlapping communities do not perform
well in such highly overlapping settings. Here, we consider one class of these
algorithms, those which optimize a local fitness measure, typically by using a
greedy heuristic to expand a seed into a community. We perform synthetic
benchmarks which indicate that an appropriate seeding strategy becomes
increasingly important as the extent of community overlap increases. We find
that distinct cliques provide the best seeds. We find further support for this
seeding strategy with benchmarks on a Facebook network and the yeast
interactome.Comment: 8 Page
Quantifying the extent to which index event biases influence large genetic association studies
This is the author accepted manuscript. The final version is available from the publisher via the DOI in this record.As genetic association studies increase in size to 100,000s of individuals, subtle biases may influence conclusions. One possible bias is "index event bias" (IEB) that appears due to the stratification by, or enrichment for, disease status when testing associations between genetic variants and a disease-associated trait. We aimed to test the extent to which IEB influences some known trait associations in a range of study designs and provide a statistical framework for assessing future associations. Analysing data from 113,203 non-diabetic UK Biobank participants, we observed three (near TCF7L2, CDKN2AB and CDKAL1) overestimated (BMI-decreasing) and one (near MTNR1B) underestimated (BMI-increasing) associations among 11 type 2 diabetes risk alleles (at P 500,000 if the prevalence of those diseases differs by > 10% from the background population. In conclusion, IEB may result in false positive or negative genetic associations in very large studies stratified or strongly enriched for/against disease cases.H.Y., A.R.W. and T.M.F. are supported by the European Research Council grant: 323195; SZ-245 50371-GLUCOSEGENES-FP7-IDEAS-ERC. S.E.J. is funded by the Medical Research Council (grant: MR/M005070/1). M.A.T., M.N.W. and A.M. are supported by the Wellcome Trust Institutional Strategic Support Award (WT097835MF). R.M.F. is a Sir Henry Dale Fellow (Wellcome Trust and Royal Society grant: 104150/Z/14/Z). R.B. is funded by the Wellcome Trust and Royal Society grant: 104150/Z/14/Z. J.T. is funded by a Diabetes Research and Wellness Foundation Fellowship. Z.K. received financial support from the Leenaards Foundation, the Swiss Institute of Bioinformatics and the Swiss National Science Foundation (31003A-143914) and SystemsX.ch (39). The work of M.P.B was supported by the National Heart, Lung, And Blood Institute of the National Institutes of Health under Award no. T32HL007779. Generation Scotland received core support from the Chief Scientist Office of the Scottish Government Health Directorates [CZD/16/6] and the Scottish Funding Council [HR03006]. E.R.P. holds a WT New investigator award 102820/Z/13/Z
CNV-association meta-analysis in 191,161 European adults reveals new loci associated with anthropometric traits
Funding Information: This research has been conducted using the UK Biobank Resource. This research has been conducted using the Danish National Biobank resource. The authors are grateful to the Raine Study participants and their families, and to the Raine Study research staff for cohort co-ordination and data collection. QIMR is grateful to the twins and their families for their generous participation in these studies. We would like to thank staff at the Queensland Institute of Medical Research: Anjali Henders, Dixie Statham, Lisa Bowdler, Ann Eldridge, and Marlene Grace for sample collection, processing and genotyping, Scott Gordon, Brian McEvoy, Belinda Cornes and Beben Benyamin for data QC and preparation, and David Smyth and Harry Beeby for IT support. HBCS Acknowledgements: We thank all study participants as well as everybody involved in the Helsinki Birth Cohort Study. Helsinki Birth Cohort Study has been supported by grants from the Academy of Finland, the Finnish Diabetes Research Society, Folkhälsan Research Foundation, Novo Nordisk Foundation, Finska Läkaresällskapet, Juho Vainio Foundation, Signe and Ane Gyllenberg Foundation, University of Helsinki, Ministry of Education, Ahokas Foundation, Emil Aaltonen Foundation. Finrisk study is grateful for the THL DNA laboratory for its skillful work to produce the DNA samples used in this study and thanks the Sanger Institute and FIMM genotyping facilities for genotyping the samples. We thank the MOLGENIS team and Genomics Coordination Center of the University Medical Center Groningen for software development and data management, in particular Marieke Bijlsma and Edith Adriaanse. This work was supported by the Leenards Foundation (to Z.K.), the Swiss National Science Foundation (31003A_169929 to Z.K., Sinergia grant CRSII33-133044 to AR), Simons Foundation (SFARI274424 to AR) and SystemsX.ch (51RTP0_151019 to Z.K.). A.R.W., H.Y. and T.M.F. are supported by the European Research Council grant: 323195:SZ-245. M.A.T., M.N.W. and An.M. are supported by the Wellcome Trust Institutional Strategic Support Award (WT097835MF). For full funding information of all participating cohorts see Supplementary Note 2. Publisher Copyright: © 2017 The Author(s).There are few examples of robust associations between rare copy number variants (CNVs) and complex continuous human traits. Here we present a large-scale CNV association meta-analysis on anthropometric traits in up to 191,161 adult samples from 26 cohorts. The study reveals five CNV associations at 1q21.1, 3q29, 7q11.23, 11p14.2, and 18q21.32 and confirms two known loci at 16p11.2 and 22q11.21, implicating at least one anthropometric trait. The discovered CNVs are recurrent and rare (0.01-0.2%), with large effects on height (> 2.4 cm), weight ( 5 kg), and body mass index (BMI) (> 3.5 kg/m(2)). Burden analysis shows a 0.41 cm decrease in height, a 0.003 increase in waist-to-hip ratio and increase in BMI by 0.14 kg/m2 for each Mb of total deletion burden (P = 2.5 x 10(-10), 6.0 x 10(-5), and 2.9 x 10(-3)). Our study provides evidence that the same genes (e.g., MC4R, FIBIN, and FMO5) harbor both common and rare variants affecting body size and that anthropometric traits share genetic loci with developmental and psychiatric disorders.Peer reviewe
- …